-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New Classification Model for Fault Detection in 3W Dataset #64
Open
yantavares
wants to merge
22
commits into
petrobras:main
Choose a base branch
from
yantavares:contrib
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
…amples parameter and update alpha value
Refactor calculate_tipping_point method to include is_validation and use_val_limit parameters Refactor test method to include use_val_limit parameter Refactor plot_SPE method to include is_validation and use_val_limit parameters Refactor test_pipeline method to include use_val_limit parameter
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR was made by the team Black Oil Pyrates in HACKTUDO.
Description:
This PR introduces a new
ClassificationModel
class to the repository, which enables the training and evaluation of custom machine learning models to classify faults in the 3W dataset. TheClassificationModel
allows for flexible model integration, supporting different algorithms for the identification and classification of class faults.Code Overview:
ClassificationModel
class:model
: Allows the user to specify a custom classification model (e.g., Random Forest).scaler
: AStandardScaler
instance used for feature normalization.load_and_prepare_data
: Loads and preprocesses data from parquet files, handling timestamp cleaning and feature extraction.train_model
: Trains the provided machine learning model on the training dataset.predict_single_file
: Makes predictions on a single file, returning actual and predicted class labels along with the timestamps.plot_predictions_with_timestamp
: Visualizes the time-series predictions alongside actual class values using Plotly.complete_analysis
: Automates the entire process from data loading to prediction and visualization.compare_models
: Compares multiple models, training and testing them on different files, and evaluates their performance using accuracy and F1-score.Example Usage:
Dependencies:
This PR adds a flexible and scalable way to train machine learning models for fault classification in the 3W dataset, with support for custom models like Random Forest and interactive plotting capabilities.